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Fuel  from  Self-Degrading  Bioengineered  Packaging 

Submitted  hy  Richard  A,  Gross 
Principle  Investigator 

Research  focused  on  developing  an  enzyme  that  efficiently  converts  plastics  to  products 
of  value  as  fuels  or  other  substances  that  would  otherwise  need  to  be  provided  to  troops 
in  the  field.  Polylactic  acid  (PLA)  was  selected  for  this  work  since  it  is  a  commercially 
available  biodegradable  plastic  with  useful  physical  attributes  both  in  fiber  and  film  form. 
For  troops  to  successfully  convert  PLA  to  useful  products,  an  enzyme  is  needed  that  can 
meet  the  requirements  below: 

>  Completely  convert  PLA  to  degradation  products  within  a  practical  time  period 
(i.e.  24  hours). 

>  Directly  convert  PLA  to  lactic  acid  for  use  in  fuel  cells. 

>  Increase  the  enzymes  tolerance  to  low  pH  environments  that  will  be 
encountered  in  non-buffered  bioreactors  during  PLA  degradation. 

>  Increase  the  activity  of  the  enzyme  to  function  at  lower  temperatures  thus 
enabling  the  development  of  conversion  processes  that  require  low  energy  input. 

The  above  goals  call  for  a  protein  engineering  strategy  that  can  rapidly  acquire  a  broad 
range  of  information  to  address  all  of  the  above  enzyme  requirements.  Furthermore, 
testing  of  engineered  enzymes  (variants)  for  all  these  activities  requires  a  new  approach. 
This  is  illustrated  by  the  need  to  identify  variants  that  produce  products  with  different 
structures.  Such  information  cannot  practically  be  attained  using  high-throughput 
methods.  Thus,  for  the  first  time  a  powerful  protein  engineering  technology  was  used  that 
generates  small  sets  (<50)  of  designed  variants  instead  of  the  1  000  to  10  000  sets 
required  by  conventional  directed  evolution  approaches.  The  protein  engineering 
technology  integrates  structural,  phylogenetic  and  statistical  sequence  information  to 
select  an  initial  set  of  function-modifying  substitutions.  No  one  piece  of  information 
(such  as  a  crystal  structure)  is  essential,  making  substitution  choice  more  robust  than  for 
any  other  method.  A  set  of  100  variants  was  designed,  each  incorporating  3-6  amino  acid 
substitutions  into  the  wild-type  sequence.  This  greatly  increased  the  information  content 
over  library-based  methods  and  eliminated  the  need  for  high-throughput  screening.  We 
were  thus  able  to  evaluate  the  clones  under  many  experimental  conditions  using  multiple 
replicates  and  methods  such  as  size-exclusion-chromatography  that  are  slow,  tedious  and 
impossible  to  convert  to  a  high-throughput  format.  Multiple  new  algorithms  for 
determining  sequence-activity  relationships  were  designed  and  used  to  construct  a  second 
set  of  15  variants  that  are  currently  under  study. 

The  results  from  the  seedling  project  have  been  a  resounding  validation  of  this  approach. 
The  accomplishments  from  just  one  cycle  of  variant  sequence  and  testing  are  summarized 
below.  If  not  otherwise  specified,  activities  are  expressed  relative  to  the  “wild  type” 
enzyme  from  the  fungus  Tritirachium  album  (heterologously  expressed  in  E.  coU).  The 
wild  type  enzyme  is  sample  "NS9". 
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Enzyme  engineering  goals 

The  goals  of  the  enzyme  engineering  part  of  this  project  are  to  produce  a  protease 
capable  of  degrading  a  useful  packaging  plastic,  to  produce  an  enzyme  that  yields  a 
specific  distribution  of  products  when  it  degrades  the  plastic,  to  improve  the  stability  of 
the  enzyme  to  higher  temperatures  required  for  plastic  manufacture  and  to  increase  the 
activity  of  the  enzyme  at  the  lower  pH  values  that  it  will  encounter  due  to  the  acid 
generated  during  plastic  degradation. 

We  set  out  to  accomplish  the  following  goals: 

1.  Construct  a  proteinase  K  that  is  well  expressed  and  easily  extractable  in  a  test 
production  system. 

2.  Engineer  proteinase  K  to  improve  its  ability  to  degrade  polylactic  acid. 

3.  Engineer  proteinase  K  to  improve  its  tolerance  to  acidity. 

3.  Engineer  proteinase  K  to  improve  its  thermostability. 

4.  Engineer  proteinase  K  to  control  the  PEA  oligomers  that  it  produces,  to  increase  the 
production  of  those  oligomers  that  have  enhanced  value  as  liquid  fuels. 

1.  Proteinase  K  production  system 

Escherichia  coli  is  a  convenient  host  for  protein  engineering:  it  grows  rapidly  and 
its  molecular  genetics  are  well  understood.  The  cDNA  and  the  chromosomal  gene 
encoding  proteinase  K  from  Tritirachium  album  Timber  have  previously  been  cloned  in  E 
coli.  The  protein  was  properly  secreted  into  the  periplasm  with  subsequent  processing  to 
the  mature  proteinase  (Gunkel  and  Gassen,  1989,  Eur.  J.  Biochem.  179:185).  As  is  often 
the  case  with  genetic  constructs  described  in  the  scientific  literature,  it  proved  impossible 
for  us  to  obtain  the  plasmid  constructed  by  Gunkel  and  Gassen.  We  also  wished  to  make 
some  improvements  to  the  published  system,  which  we  did  by  re-designing  and  then 
synthesizing  the  entire  gene. 

MKKL'LFAl^lNV^YYS'B.SJMAPAVEQRSEAAPLIEARGEMVANKYIVKFKEGSALSALDAAMEKISGKPDH 

VYKNVFSGFAATLDENMVRVLRAHPDVEYIEQDAWTINAAQJ'i^APVlG'LAPlSSTSPGTSTYYYPiESAGQGS 

CVYVIDTGIEASHPEFEGRAQMVKTYYYSSRDGNGHGTHCAGTVGSRTYGVAKKTQLFGVKVLDDNGSGQYS 

TIIAGMDFVASDKNNRNCPKGVVASLSLGGGYSSSVNSAAARLQSSGVMVAVAAGNNNADARNYSPASEPSV 

CTVGASDRYDRRSSFSNYGSVLDIFGPGTSILSTWIGGSTRSISGTSMATPHVAGLAAYLMTLGKTTAASAC 

RYIADTANKGDLSNIPFGTVNLLAYNNYQAHHHHHH 

Figure  L  Amino  acid  sequence  of  proteinase  K. 

The  redesigned  amino  acid  sequence  of  proteinase  K  is  shown  in  Eigure  1.  The 
first  18  amino  acids  (underlined)  are  an  E  coli  signal  peptide  derived  from  the  pBAD/glll 
vector,  followed  by  two  linker  amino  acids.  An  E  coli  signal  sequence  was  used  to 
improve  secretion  of  the  heterologous  protein.  The  Tritirachium  album  proteinase  K 
propeptide  is  italicized,  followed  by  the  sequence  of  the  mature  protein.  Einally  we 
added  a  histidine  tag  (underlined)  to  the  C-terminus  of  the  protein.  This  allows 
convenient  and  rapid  purification  of  the  proteinase  K  variants,  allowing  us  to  assess  their 
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activities  in  the  absenee  of  eontaminating  proteases  endogenous  to  E  coli  and  of  other 
proteins  present  in  the  E  coli  periplasm  whieh  eould  aet  as  eompetitive  inhibitors  of  the 
proteinase  K. 

To  further  improve  expression  of  proteinase  K,  we  redesigned  the  DNA  eneoding 
the  protein.  Different  organisms  preferentially  use  different  eodons  to  speeify  the  same 
amino  aeid.  These  differenees  are  partieularly  marked  in  E  eoli,  and  the  presenee  of 
eodons  that  are  not  generally  used  by  E  eoli  often  eauses  signifieant  reduetion  in 
expression  levels  of  heterologous  proteins.  We  therefore  used  eodons  at  the  frequeneies 
that  they  are  found  in  highly  expressed  E  eoli  genes,  and  used  no  eodon  that  was  used  less 
than  10%  of  the  time  in  these  genes.  The  redesigned  DNA  sequenee  for  proteinase  K 
from  the  alanine  at  position  20  to  the  alanine  preeeding  the  histidine  tag  is  shown  in 
Eigure  2. 

GCACCGGCCGTTGAACAGCGTTCTGAAGCAGCTCCTCTGATTGAGGCACGTGGTGAAATGGTAGCAAACAA 

GTACATCGTGAAGTTCAAGGAGGGTTCTGCTCTGTCTGCTCTGGATGCTGCTATGGAAAAGATCTCTGGCAA 

GCCTGATCACGTCTATAAGAACGTGTTCAGCGGTTTCGCAGCAACTCTGGACGAGAACATGGTCCGTGTACT 

GCGTGCTCATCCAGACGTTGAATACATCGAACAGGACGCTGTGGTTACTATCAACGCGGCACAGACTAACGC 

ACCTTGGGGTCTGGCACGTATTTCTTCTACTTCCCCGGGTACGTCTACTTACTACTACGACGAGTCTGCCGG 

TCAAGGTTCTTGCGTTTACGTGATCGATACGGGCATCGAGGCTTCTCATCCTGAGTTTGAAGGCCGTGCACA 

AATGGTGAAGACCTACTACTACTCTTCCCGTGACGGTAATGGTCACGGTACTCATTGCGCAGGTACTGTTGG 

TAGCCGTACCTACGGTGTTGCTAAGAAAACGCAACTGTTCGGCGTTAAAGTGCTGGACGACAACGGTTCTGG 

TCAGTACTCCACCATTATCGCGGGTATGGATTTCGTAGCGAGCGATAAAAACAACCGCAACTGCCCGAAAGG 

TGTTGTGGCTTCTCTGTCTCTGGGTGGTGGTTACTCCTCTTCTGTTAACAGCGCAGCTGCACGTCTGCAATC 

TTCCGGTGTCATGGTCGCAGTAGCAGCTGGTAACAATAACGCTGATGCACGCAACTACTCTCCTGCTAGCGA 

GCCTTCTGTTTGCACCGTGGGTGCATCTGATCGTTATGATCGTCGTAGCTCCTTCAGCAACTATGGTTCCGT 

CCTGGATATCTTCGGCCCTGGTACTTCTATCCTGTCTACCTGGATTGGCGGTAGCACTCGTTCCATTTCCGG 

TACGAGCATGGCTACTCCACATGTTGCTGGTCTGGCAGCATACCTGATGACCCTGGGTAAGACCACTGCTGC 

ATCCGCTTGTCGTTACATCGCGGATACTGCGAACAAAGGCGATCTGTCTAACATCCCGTTCGGCACCGTTAA 

TCTGCTGGCATACAACAACTATCAGGCT 

Eigure  2.  DNA  sequence  encoding  proteinase  K  optimized  for  expression  in  E  coli. 

After  eonstrueting  the  gene  as  shown  above,  we  plaeed  it  under  eontrol  of  the 
arabinose  promoter.  E  coli  transformed  with  the  resulting  eonstruet  produeed  mature 
aetive  proteinase  K  seereted  into  the  periplasm.  This  proteinase  K  ean  then  be  purified 
using  eommereially  available  niekel-NTA  affinity  matriees.  Thus  we  have  suecessfully 
eonstrueted  a  produetion  system  suitable  for  the  analysis  of  proteinase  K  variants. 

2,  Creation  of  functional  proteinase  K  variants 

To  engineer  into  proteinase  K  the  properties  that  will  be  required  for  it  to  usefully 
degrade  polylaetie  aeid,  we  are  adopting  an  approaeh  in  whieh  small  numbers  of  variants 
are  synthesized  and  tested;  the  relationship  between  sequenee  and  aetivity  is  then 
modeled  mathematieally  and  used  to  design  variants  that  are  further  improved.  This 
approaeh  was  ehosen  beeause  some  of  the  parameters  that  are  important  in  this  projeet, 
partieularly  the  range  of  oligomers  produeed  by  the  proteinase,  are  not  readily  amenable 
to  high-throughput  sereening.  To  obtain  meaningful  polylaetate  degradation  data  requires 
growing  about  1  liter  of  eells;  feasible  for  tens  or  even  a  hundred  variants,  but  eertainly 
not  for  the  thousands  that  are  generally  needed  in  eaeh  round  for  direeted  evolution 
methods. 
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To  obtain  a  sequence  activity  relationship,  it  is  first  necessary  to  create  a  set  of 
variants  in  which  each  mutation  occurs  more  than  once  and  in  combination  with  different 
mutations.  These  combinations  of  sequences  and  their  associated  activities  are  then 
amenable  to  analysis  by  mathematical  tools  developed  in  other  fields  of  engineering. 

We  used  several  criteria  to  select  positions  within  the  proteinase  K  sequence  for 
modification.  These  included  work  such  as  structural  studies  published  in  the  scientific 
literature,  changes  that  are  likely  to  be  acceptable  based  on  empirically-derived  amino 
acid  substitution  matrices,  phylogenetic  analysis  of  homologs  and  principal  component 
analysis  of  the  sequences  of  homologs  with  known  desirable  properties,  particularly 
thermostability.  The  first  24  selected  changes  are  shown  in  Table  1. 

As  an  initial  set  of  variants  we  designed  and  synthesized  24  genes,  each  of  which 
incorporated  6  of  these  changes.  We  also  ensured  that  each  change  was  represented  in  6 
of  the  genes.  We  used  casein  as  a  sensitive  substrate  simply  to  assess  whether  or  not  the 
variant  proteinase  K  proteins  were  at  all  active.  From  our  initial  set  we  obtained  3 
variants  with  some  activity,  two  of  which  were  more  active  degraders  of  casein  than  the 
original  protein. 


Variation 

N95C 

P97S 

SI  OVD 

S123A 

1 132  V 

E138A 

M145F 

Y151A 

VI 671 

L180I 

Y194S 

A199S 

K208H 

A236V 

R237N 

P265S 

V267I 

S273T 

G293A 

L299C 

131  OK 

K332R 

S337N 

P355S 


Primary  reason  for  inclusion 

Structural  stability  at  higher  temperature:  from  published  literature 

P  to  S  for  flexibility  and  structural  perturbabtion 

Seen  in  active  homologs 

Found  in  the  thermostable  consensus  sequence 

Substitution  matrix-derived  conservative  change  (controlled  perturbation) 

Seen  in  experiments  in  the  literature 

Seen  in  experiments  to  improve  thermostability 

Seen  in  experiments  to  improve  thermostability 

Substitution  matrix-derived  conservative  change  (controlled  perturbation) 
Substitution  matrix-derived  conservative  change  (controlled  perturbation) 
Varaiation  observed  in  highly  active  clone 

Substitution  matrix-derived  conservative  change  (controlled  perturbation) 
PCA  modelling  of  thermostable  homologs  collected  from  GenBank. 

PCA  modelling  of  thermostable  homologs  collected  from  GenBank. 

From  experiments  to  improve  thermostability  (in  literature) 

P  to  S  for  flexibility  and  structural  perturbabtion 

Substitution  matrix-derived  conservative  change  (controlled  perturbation) 
Multiple  sources  identify  this  change,  (thermostability  and  other) 
Thermostability  considerations  (observed  in  thermitases) 

Could  form  a  disulphide  bridge  with  N95C  (from  literature) 

From  structural  studies 

Thermostability  considerations  (observed  in  thermitases) 

Thermostability  considerations  (observed  in  thermitases) 

P  to  S  for  flexibility  and  structural  perturbabtion 


Table  1.  The  first  24  positions  in  proteinase  K  chosen  for  variation. 
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NS# 


25  34  50  55  M  97 

H  S  N  S  C  S 


107  123  132  138  145  151 
D  A  V  A  FA 


167  180  194  199 

I  I  S  S 


208  236  237  265  267  273  293  299 

H  V  N  S  I  T  A  C 


310  332  337  355 

K  R  N  S 


362 

M 


NS1 

NS9 

NS10 

NS12 

NS13 

NS14 

NS15 

NS16 

NS18 

NS19 

NS20 

NS21 

NS22 

NS23 

NS24 

NS25 

NS26 

NS27 

NS28 

NS29 

NS30 

NS31 

NS32 

NS33 

NS34 

NS35 

NS36 

NS37 

NS38 

NS39 

NS40 

NS41 

NS42 

NS43 

NS44 


Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 
H  A 

Y  A 

Y  S 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 


D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
N  N 
D  N 
D  N 
D  N 
D  S 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 


N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
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N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 
N  P 


S 

S 

S 

S 

D 

S 

S 

S 

S 

S 

S 

S 

S 

S 

S 

S 

S 

S 

S 

D 

D 

S 

S 

D 

D 

S 

S 

S 

S 

S 

S 

D 

S 

S 


S  I  E  M 
S  I  E  M 
A  I  E  M 
S  V  E  F 
S  I  E  M 
S  I  E  M 
S  I  E  M 
SIAM 
S  I  E  M 
S  I  E  M 
S  I  E  M 
S  I  E  M 
S  I  E  M 
S  I  E  M 
S  I  E  M 
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V 

V 
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V 

V 
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V 
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V 

V 

I 

V 

V 
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Table  2.  Sequence  variations  in  active  proteinase  K  variants 

Using  this  information  we  designed  and  synthesized  an  additional  24  genes,  8  of 
whieh  were  designed  to  test  the  amino  aeids  that  we  had  not  seen  represented  in  any  of 
the  aetive  variants,  the  remaining  16  were  designed  to  test  different  eombinations  of 
amino  aeid 

ehanges  that  appeared  to  allow  some  proteinase  activity.  The  combinations  of  amino  acid 
changes  that  resulted  in  enzymes  with  some  activity  in  the  degradation  of  casein  are 
shown  in  Table  2.  Combinations  of  amino  acids  that  yielded  proteinase  K  variants  with 
no  apparent  protease  activity  are  shown  in  Table  3. 


Final  Technical  Report  -  Research  Agreement  No,  DAAD19-03-1-0091 


25  34  50  55  95  97 


Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 

Y  A 


D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 
D  N 


C  P 
N  S 
N  P 
N  S 
N  P 
C  P 
C  P 
N  P 
N  S 
C  P 
N  P 
N  S 
N  P 
N  S 
C  P 
N  P 
N  P 
N  P 
N  P 
N  S 
C  P 
C  P 
N  S 
N  S 
N  P 
N  P 
N  P 
N  P 


107  123  132  138  145  151 


S  S 

s  s 

D  S 

D  S 

S  A 

S  S 

S  S 

S  S 

S  A 

D  A 

S  S 

S  S 

S  S 

S  S 

D  S 

S  S 

S  A 

S  A 

D  S 

S  S 

S  S 

S  S 

S  S 

S  S 

S  S 

D  A 

S  S 

S  S 


I  E 
I  A 

V  E 
I  E 
I  A 
I  E 

V  A 
I  E 
I  E 
I  E 
I  A 
I  E 

V  E 
I  E 
I  E 
I  E 

V  A 
I  E 

V  A 
I  E 
I  E 
I  E 
I  E 
I  E 
I  E 
I  E 
I  E 
I  E 


F  Y 
M  Y 
M  Y 
M  Y 
M  Y 
M  Y 
F  Y 
M  Y 
M  Y 
M  Y 
M  A 
F  Y 
M  Y 
M  A 
M  Y 
F  Y 
M  Y 
M  Y 
F  Y 
M  Y 
M  A 
M  Y 
M  Y 
M  Y 
M  Y 
F  Y 
F  Y 
F  A 


167 

I 

V 

V 

V 

V 

V 
I 

V 

V 

V 

I 

V 

V 

V 

I 

V 

V 

V 

V 

V 
I 

V 

V 

V 

V 

V 

V 

V 


180  194  199  208  236  237  265  267 


L  Y 
I  S 
L  Y 
I  Y 
L  Y 
L  S 
L  Y 
L  Y 
L  Y 
I  S 
L  Y 
L  Y 
L  Y 
L  Y 
I  Y 
L  Y 
L  S 
L  Y 
L  Y 
I  S 
L  Y 
L  Y 
L  Y 
L  S 
L  Y 
L  Y 
I  Y 
L  Y 


S  K 
A  K 
A  H 
A  K 
S  H 
A  K 
A  H 
A  K 
A  K 
A  K 
S  H 
A  K 
A  K 
S  K 
A  K 
A  K 
A  H 
A  H 
A  K 
S  K 
A  K 
A  K 
A  K 
A  K 
A  K 
A  K 
A  K 
S  K 


A  N 

V  R 
A  R 

V  N 
A  R 
A  R 
A  R 

V  N 
A  R 
A  R 
A  R 
A  N 

V  R 

V  R 
A  R 
A  N 
A  R 

V  R 
A  R 
A  R 
A  N 
A  R 
A  R 
A  R 

V  R 
A  R 
A  R 
A  R 


P  V 
P  I 
S  V 
P  V 
S  V 
P  I 
P  V 
P  V 
S  V 
P  V 
P  V 
P  V 
S  I 
P  V 
P  V 
S  I 
P  V 
P  I 
P  V 
S  V 
P  V 
P  V 
P  V 
P  V 
P  V 
P  V 
P  I 
P  V 


Table  3.  Sequence  variations  in  inactive  proteinase  K  variants 
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3.  Use  of  preliminary  screens 

The  aim  of  this  project  is  to  optimize  proteinase  K  for  the  degradation  of  polylactic 
acid.  All  of  the  variants  shown  in  Table  2  have  now  been  transferred  to  the  Gross  lab  for 
analysis  of  their  polylactic  acid  degrading  properties.  Because  some  of  the  properties  of 
interest  may  potentially  be  measured  independently  we  have  used  preliminary  screens  to 
test  the  sequence-function  modeling  approach  on  which  the  project  is  based. 

We  have  chosen  two  preliminary  screens.  The  first  of  these  is  the  ability  of  E  coli 
strains  carrying  plasmids  encoding  proteinase  K  variants  to  hydrolyze  casein.  Casein  is  a 
protein  in  skimmed  milk  whose  incorporation  into  the  media  on  which  E  coli  grows 
causes  a  cloudy  appearance  to  the  plate.  Hydrolysis  of  casein  results  in  a  clear  zone 
surrounding  the  colony.  We  use  this  screen  as  a  crude  test  for  proteinase  activity. 

The  second  preliminary  screen  that  we  have  used  is  the  cleavage  of  p-mixo  anilide 
peptides,  which  results  in  an  increase  in  absorbance  at  405  nm.  These  substrates  are  not 
intended  as  a  surrogate  for  activity  of  proteinase  K  towards  polylactic  acid.  Indeed  we 
see  clear  differences  in  the  activity  of  some  of  our  variants  towards  casein  (a  large  and 
heterogeneous  substrate)  and  p-mixo  anilide  tripeptides.  Some  variants  prefer  casein  and 
others  prefer  the  small  peptides.  This  simply  emphasizes  the  importance  of  screening 
variants  using  the  substrate  of  interest,  in  this  case  polylactic  acid.  However,  we  are  not 
only  interested  in  the  substrate  preferences  of  the  proteinase  K,  we  are  also  interested  in 
modifying  its  physical  properties.  Specifically  we  would  like  to  increase  the  tolerance  of 
proteinase  K  to  heat  treatment,  in  order  to  allow  its  incorporation  into  the  polylactic  acid 
during  manufacture.  We  would  also  like  to  increase  the  activity  of  proteinase  K  at  low 
pH,  so  that  it  can  function  in  the  presence  of  the  lactic  acid  that  will  be  produced  during 
its  degradation  of  polylactic  acid. 


4.  Increase  of  acid  tolerance  of  proteinase  K 

We  tested  all  of  the  variants  shown  in  Table  2  for  their  activity  towards  a  p-nitro 
anilide  substrate  at  three  different  pHs  (4.5,  5.5  and  7.5).  Our  aim  was  to  see  whether  the 
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variations  that  we  had  introduced  were  suitable  for  modifying  the  physical  properties  of 
the  enzyme. 

Figure  3  shows  that  four  of  the  variants  synthesized  have  3-4  times  the  activity  of 
the  wild  type  enzyme  at  pH4.5,  compared  with  less  than  2-fold  improvement  over  wild 
type  at  pH7.5.  This  shows  that  we  have  identified  and  combined  some  residues  that  shift 
the  pH  profile  of  proteinase  K.  We  may  therefore  anticipate  that  we  will  be  able  to 
increase  the  activity  of  proteinase  K  for  the  degradation  of  polylactic  acid,  specifically  for 
improving  its  tolerance  to  the  low  pH  resulting  from  lactic  acid  formation. 


Figure  3.  Activity  of  four  variants  and  wild  type  proteinase  K  towards  AAPL-p-nitro 
anilide  at  three  different  pHs.  The  same  amount  of  purified  proteainse  K  enzyme 
(approximately  20  ng)  was  incubated  with  AAPL-/7-nitro  anilide  at  pH  4.5  (1),  pH  5.5  (2) 
and  pH  7.5  (3).  Activity  was  measured  as  an  increase  in  the  absorbance  at  405  nm.  Four 
of  the  variants  that  possessed  increased  activity  at  pH  7.5  (in  descending  order  of  activity: 
NS40  in  turquoise,  NS23  in  pink,  NS15  in  blue  and  NS35  in  yellow  compared  with  wild 
type  in  purple)  also  had  broadened  pH  activity  profiles. 
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5,  Increase  of  thermostability  of  proteinase  K 

We  tested  all  of  the  variants  shown  in  Table  2  for  their  activity  towards  a  p-nitro 
anilide  substrate  following  a  5  minute  treatment  of  the  purified  enzyme  at  65°C.  Our  aim 
was  to  see  whether  the  variations  that  we  had  introduced  were  suitable  for  modifying  the 
thermal  tolerance  of  the  enzyme,  a  necessary  property  for  the  proteinase  to  withstand  the 
manufacturing  process. 


Figure  4.  Activity  of  four  variants  and  wild  type  proteinase  K  towards  AAPL-p-nitro 
anilide  following  heat  treatment.  Duplicate  samples  of  purified  proteinase  K  variants 
were  either  incubated  on  ice  for  5  minutes  or  incubated  at  65°C  for  five  minutes.  The 
enzymes  were  then  incubated  with  with  AAPL-p-nitro  anilide  at  pH  7.5.  Activity  was 
measured  by  an  increase  in  absorbance  at  405  nm.  The  activity  of  the  variant  following 
heat  treatment  was  divided  by  its  activity  without  heat  treatment  to  calculate  the  stability 
ratio,  shown  on  the  y-axis.  Variants  were  NS40  (1),  NS32  (2),  NS19  (3)  and  wild  type 
(4). 


Figure  4  shows  that  three  of  the  variants  synthesized  are  more  stable  to  heat 
treatment  than  the  wild  type  enzyme.  This  shows  that  we  have  identified  and  combined 
some  residues  that  improve  the  thermal  tolerance  of  proteinase  K.  We  may  therefore 
anticipate  that  we  will  be  able  to  increase  the  activity  of  proteinase  K  following  its 
incorporation  into  polylactate  during  manufacture. 

Performance  of  variants  at  the  optimal  pH  of  Proteinase  K. 

Proteinase  K  has  an  optimal  pH  of  8.6  under  native  conditions.  To  identify  key  amino 
acids  that  improve  the  performance  of  this  enzyme  under  optimal  conditions.  Poly  lactic 
acid  was  hydrolysed  with  wild  type  and  mutant  enzymes  at  37  C  for  48  hours.  At  the  end 
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of  the  time  period  the  mixture  was  filtered  and  the  pH  was  reeorded.  The  filtrate  was  then 
analysed  by  LC-MS  using  a  water  acetonitrile  mobile  phase.  The  product  profile  was 
evaluated  using  lithium  lactate  as  a  monomer  standard,  lactic  acid  lactide  as  a  dimer 
standard  and  commercial  lactic  acid  (mixture  of  monomer,  dimer  and  trimer)  as  a  trimer 
standard.  The  yield  of  lactic  acid  monomer,  dimer  (LL  and  LD)  and  trimer  was 
determined  in  mmoles  and  the  background  activity  was  subtracted  using  a  buffer  control. 
In  order  to  observe  the  improvement  of  various  mutations,  the  yields  were  expressed 
relative  to  that  of  NS9  (the  wild  type  enzyme). 
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□  Lactic  Acid  Dimer 


Variants 


Figure  5.  Relative  yield  of  Lactic  acid  dimer  (with  reference  to  the  yield  of  the  wild  type 
enzyme,  NS9). 


The  analysis  of  the  hydrolysis  products  at  pH  8.6  indicated  that  a  significant 
improvement  had  been  achieved  with  variant  NS  40  having  a  near  5 -fold  improvement  in 
activity  (See  Figure  5).  While  a  number  of  variants  had  lower  yields  than  the  wild  type 
enzyme,  a  large  group  had  better  yields  than  the  wild  type  enzyme.  This  clearly  indicated 
that  the  enzyme  improvement  strategy  was  useful  in  developing  better  candidates. 


Altered  activity  at  lower  pH  conditions. 

As  the  optimal  pH  of  Proteinase  K  is  8.6,  this  enzyme  is  quickly  inhibited  at  lower  pH 
values.  We  set  out  to  screen  the  variants  at  pH  6.5  and  5.5  to  identify  those  that  would  not 
be  inhibited  by  the  accumulation  of  an  acidic  product  such  as  lactic  acid  or  its  oligomers. 
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Performance  of  Variants  at  various  pH  conditions 


□  pH  5.5 
n  pH  6.5 

□  pH  8.5 


(£.  . . 

9  32  40  41  48  55  56  57  58  81  83  101 

Variant  Number 

Figure  6.  Relative  yield  of  Lactic  acid  dimer  (with  reference  to  the  yield  of  the  wild  type 
enzyme,  NS9)  at  pH  5.5,  6.5  and  8.6. 

The  yield  of  lactic  acid  dimer  (LL  and  LD)  was  determined  in  mmoles  and  the 
background  activity  was  subtracted  using  a  buffer  control.  In  order  to  observe  the 
improvement  of  various  mutations,  the  yields  were  expressed  relative  to  that  of  NS9  (the 
wild  type  enzyme).  As  there  was  no  monomer  detected  at  pH  5.5,  the  performance  of  the 
variants  was  evaluated  by  the  relative  yields  of  the  lactic  acid  dimer  (LL  and  LD).  A 
number  of  variants  were  found  to  perform  very  well  at  pH  5.5.  The  highlight  of  this 
screen  is  variant  NS  58  which  has  a  12  fold  improvement  over  the  wild  type  and  out 
performs  even  commercial  preparations  of  Proteinase  K  purchased  from  Sigma  chemical 
Co  St  Louis. 

Thermal  stability 

Industrial  and  field  applications  of  biocatalysists  call  for  a  very  stable  enzyme.  To  screen 
our  variants  for  heat-stability,  we  heat-denatured  the  enzyme  preparations  at  60  °C  for  15 
minutes  prior  to  hydrolysis  of  PLA  at  37  °C  and  pH  8.6.  All  the  variants  were  screened 
under  these  conditions  to  identify  those  that  would  be  thermostable. 
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Effect  of  Heat  T reatment  on  product  profile 
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Figure  7.  Yield  of  Lactic  acid  Oligomers  (monomer,  dimer  and  trimer)  in  mmoles  at  pH 
8.6  after  a  15  minute  heat  denaturation  at  60  °C. 


While  most  enzymes  succumbed  to  heat  denaturation  and  lost  all  their  activity  as  in  the 
case  of  the  wild  type  enzyme  (see  Figure  7),  certain  variants  remained  active  after  heat- 
denaturation.  The  most  striking  variant  in  this  study  was  variant  NS  27  which  retained  67 
%  of  its  activity  after  heat-denaturation  (see  Figure  8). 
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Figure  8.  Residual  activity  after  heat  denaturation:  Ratio  of  the  yield  of  dimer  (LL  and 
LD)  after  heat  denaturation  to  that  of  the  untreated  enzyme. 
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Figure  9.  Profile  of  lactic  acid  oligomeric  products  (monomer,  dimer  and  trimer)  relative 
to  the  wild  type  enzyme  (NS  9). 
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A  key  feat  in  protein  engineering  is  the  ability  to  alter  the  product  profile  of  an  enzyme. 
Such  an  approach  can  yield  novel  products  and  can  be  used  to  tailor  the  reaction  towards 
specific  products.  We  have  analyzed  the  products  of  all  our  variants  at  pH  8.6  and  looked 
for  variants  which  differed  significantly  from  the  wild  type  enzyme.  While  a  majority  of 
the  variants  had  increased  yields  of  the  lactic  acid  trimer  (see  Figure  9)  such  as  NS  27,  33 
40,  41  and  84,  NS  63  though  lower  in  total  activity  had  and  increased  yield  of  lactic  acid 
monomer.  This  result  indicated  a  significant  change  in  product  profile  from  the  general 
trend. 


Summary 

Analysis  of  variant  structures  resulting  in  the  above  improvements  showed  that  amino 
acid  substitutions  important  for  one  function  are  different  from  those  that  are  important  to 
improve  another.  This  emphasizes  the  importance  of  using  low-throughput,  high 
information-content  screens.  It  is  well  known  that  high-throughput  surrogate  screens 
increase  the  noise  and  possibility  of  false-positives.  This  leads  to  lost  time  as  information 
is  wrongly  interpreted  and  used  in  the  design  of  next  round  variant  pools.  Equally 
dangerous  is  the  omission  of  variants  that  are  not  recognizing  for  the  beneficial  changes 
they  provide.  These  difficulties  are  largely  overcome  by  this  new  protein  engineering 
strategy  for  which  small  variant  sets  become  the  focus  of  careful  and  comprehensive 
studies  to  characterize  variant  activities.  Furthermore,  8  different  algorithms  for  sequence 
activity  modeling  have  already  been  tested  for  sequence  activity  modeling  and  are 
generally  proving  to  be  robust  and  capable  of  predicting  optimal  sequences  as  well  as 
designing  information-rich  variants. 

Conclusion 

Directed  evolution  approaches  must  screen  thousands  of  variants  per  cycle  to  obtain 
improvement  factors  that  normally  are  1.5-  to  2-fold.  Such  large  numbers  of  variants 
makes  it  impractical  to  screen  multiple  enzyme  performance  characteristics.  Furthermore, 
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the  results  from  high  through-put  sereening  are  often  are  tainted  by  false  positives  and 
negatives.  The  projeet  undertaken  to  engineer  PL  A  degrading  enzymes  needed  low 
throughput  assays  that  eould  aeeurately  seleet  variants  that  offer  improvements  in  a  wide 
range  of  enzyme  charaeteristies  ineluding  the  structure  of  products  formed.  This  required 
a  new  technological  approach  to  protein  engineering.  Using  an  integrated  substitution 
choice  method,  and  sequence-activity  models,  we  have  obtained  a  dramatic  shift  in 
enzyme  activity,  while  screening  only  a  total  of  100  variants.  The  small  number  of 
variants  required  allowed  us  to  accurately  assess  the  distribution  of  products  formed  and 
enzyme  behavior  while  varying  different  physical  parameters  that  are  important  for  final 
process  design  such  as  temperature  and  pH. 


